ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel...
Topics: archiveteam, archivebot, webcrawl, robot, love
94M
94M
Nov 7, 2020
11/20
by
Archive Team
587.8M
588M
Jan 21, 2016
01/16
by
Archive Team
Archive Team now searches many, many news sites, including extensive worldwide and obscure sources, to capture unique news stories for history.
123.5M
123M
Oct 14, 2016
10/16
by
Archive Team
Google has been planning to shut down panoramic photo sharing site Panoramio since September 2014. The initial plan was to merge it with Google Views which was a similar product. However, due to feedback from the Panoramio community they held off that move. Frank did an in depth post about this in June 2015. Since then Google Views itself was merged into Street View. Google has now announced that they are finally shutting down Panoramio for good. As of November 4th, 2016, they will stop...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
62.4M
62M
Jan 9, 2018
01/18
by
Archive Team
Remember that one time Nintendo tried to build its own social network? Unless you had a Wii U, you probably don't -- but it was called Miiverse, and it was weird, awkward and kind of wonderful. The odd social platform let users take screenshots in most Wii U and 3DS games (a feature otherwise missing from many of Nintendo's consoles, yet common on other platforms), chat about games with other users and draw fan art. But it wasn't long for this world. Late last year, Nintendo shuttered Miiverse...
54.5M
54M
Dec 23, 2013
12/13
by
Archive Team
Wretch (Chinese: 無名小站; pinyin: wúmíng xiǎo zhàn) is a Taiwanese community web site; in Chinese, its name means Nameless Little Site. It is the most well-known blog community in Taiwan with thousands of users registered. Wretch provides free photo album, and blog hosting services. Four languages, including English, are available. A more extensive VIP version is offered. It is the top visited site in Traditional Chinese languages and the second in Taiwan after Yahoo Taiwan according...
10.5M
10M
Jan 11, 2021
01/21
by
Archive Team
Results of the Archive Team Ne Parle Pas Project
64M
64M
Nov 3, 2015
11/15
by
Archive Team
A collection of various Wikis throughout the internet, gathered as a last resting spot.
17.8M
18M
Dec 8, 2018
12/18
by
Archive Team
TUMBLR is a blogging site that was later purchased by Verizon/Oath and experienced a transient removal from the Apple Store over lack of policing of its content; Tumblrs response was to build an automated foot-shooting machine and commit online suicide. This is a general scale archiving of tumblr blogs marked for deletion.
A collection of news articles grabbed from a wide variety of sources around the world automatically by Archive Team scripts.
20.6M
21M
Jun 23, 2018
06/18
by
Archive Team
The mirroring of Youtube involves collections of high-contention or representative videos and providing a more permanent home for these videos. They are meant for historical records and research. Mirrored videos in this collection are not included in general search engine results.
30.8M
31M
Feb 23, 2012
02/12
by
Archive Team
The hardest part about our transient, shallow world wide web is the terrifying swiftness in which data disappears. To this end, Archive Team members have often bravely strapped on miner's helmets and flashlights, dove into the flaming wreckage of a dying site, and grabbed a copy for all of time. Some of these rescues, consisting of what we could grab, are being saved here. Please Note: Some of these items were not burning as brightly or recently as others - they might be merely considered...
3.2M
3.2M
Oct 27, 2020
10/20
by
Archive Team
24.8M
25M
Mar 13, 2013
03/13
by
Archive Team
Xanga /ˈzæŋɡə/ is a website that hosts weblogs, photoblogs, and social networking profiles. It is operated by Xanga.com, Inc., based in New York City. In September of 2013 Xanga relaunched under the assumed name of Xanga 2.0. Xanga/Xanga 2.0 is no longer a free blogging webspace. Users will now have to pay an annual fee of $48.00. The intellectual property of many users has since been lost. Xanga only saved archives from users that posted in the last five years (2008-2013). On their...
Topics: Xanga, Doomed, Archive Team
13.7M
14M
Sep 15, 2017
09/17
by
Archive Team
This item contains regular captures of Dutch news websites in screenshot and WARC format. Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.
697,898
698K
Aug 15, 2020
08/20
by
Archive Team
A "Freeze Frame" of the WEBSHOTS Website, on the occasion of it announcing deletion.
92,702
93K
web
eye 92,702
favorite 0
comment 0
1.3M
1.3M
Sep 28, 2020
09/20
by
Archive Team
10.8M
11M
Nov 20, 2013
11/13
by
Archive Team
Hyves is a small social networking site in the Netherlands with mainly Dutch visitors and members, where it competes with sites such as Facebook and MySpace. Hyves was founded in 2004 by Raymond Spanjar and Floris Rost van Tonningen. The service is available in both Dutch and English. In May 2010 Hyves had more than 10.3 million accounts. These correspond to two thirds of the size of the Dutch population (which stands at over 16 million in 2010), however these include multiple accounts per...
1.2M
1.2M
Dec 29, 2020
12/20
by
Archive Team
A collection of news articles grabbed from a wide variety of sources around the world automatically by Archive Team scripts.
1.8M
1.8M
Dec 14, 2019
12/19
by
Archive Team
An Archive Team group grab of the game streaming site PLAYS.TV. "Plays.tv is one of the best ways to record, review, and share your gameplays. The app has been known to be In fact, it is one of the most popular recording softwares used for League of Legends. Unfortunately, everything good must come to an end. The developers of Plays.tv announced that the site is closing down. They will also discontinue support for the website and desktop application on December 15."
375,379
375K
Feb 3, 2021
02/21
by
Archive Team
1.2M
1.2M
Aug 13, 2018
08/18
by
Archive Team
This collection is a set of Github repository archives from two major sets: A panic grab upon the acquisition by Microsoft, and a larger, ongoing set of Pretty Much Everything.
5.9M
5.9M
Jan 2, 2016
01/16
by
Archive Team
When we started the Google Code project hosting service in 2006, the world of project hosting was limited. We were worried about reliability and stagnation, so we took action by giving the open source community another option to choose from. Since then, we’ve seen a wide variety of better project hosting services such as GitHub and Bitbucket bloom. Many projects moved away from Google Code to those other systems. To meet developers where they are, we ourselves migrated nearly a thousand of...
2.8M
2.8M
Apr 11, 2016
04/16
by
Archive Team
"Estimados miembros, os informamos que Fotolog estará inaccesible de forma permanente en las próximas semanas. El objetivo de esta comunicación es que podáis recuperar todos vuestros datos e informaciones lo antes posible, y en cualquier caso antes del 20 de Febrero del 2016. Esperamos que podáis continuar con vuestros blogs y compartir vuestras fotos en otras plataformas. Por favor, haced circular esta información a todos los demás miembros de la comunidad".
4.3M
4.3M
Apr 19, 2015
04/15
by
Archive Team
This is a grab of FurAffinity.net, a site hosting art of interest to the furry fandom. The grab is from 2015. See http://archiveteam.org/index.php?title=FurAffinity for more details.
35.7M
36M
Jun 10, 2015
06/15
by
Archive Team
POMF is a sound effect and onomatopoeia describing the sound someone makes as they fall onto a bed or a similar surface. It is commonly described through the symbolia =3 and combined with the catchphrase “What are we gonna do on the bed?”. On the internet, the term has gained usage in both verbal and image variations and is often used as an exploitable.
5M
5.0M
Apr 6, 2016
04/16
by
Archive Team
The ArchiveTeam Videobot is an automated collector of relevant and historic video items, making them playable and preserved on the Internet Archive.
3.5M
3.5M
Apr 12, 2016
04/16
by
Archive Team
MusicBrainz is a project that aims to create an open content music database. Similar to the freedb project, it was founded in response to the restrictions placed on the CDDB. However, MusicBrainz has expanded its goals to reach beyond a compact disc metadata storehouse to become a structured open online database for music. MusicBrainz captures information about artists, their recorded works, and the relationships between them. Recorded works entries capture at a minimum the album title, track...
3.6M
3.6M
Mar 31, 2013
03/13
by
Archive Team
Formspring was a question-and-answer-based social networking service for social conversations operated by Formspring.me, Inc. It was launched on 1 November 2009 by Ade Olonoh, the founder of online form builder Formstack. In May 2013, Spring.me acquired the assets of Formspring. The rebranded website was officially launched in beta in September 2013 and launched publicly in November 2013. Formspring was launched in Indianapolis in November 2009 by the founder of online form builder Formstack,...
5.4M
5.4M
Mar 16, 2013
03/13
by
Archive Team
January 3rd, 2013 Dear Punchfork Community, Today we are excited to share the news that Pinterest has acquired Punchfork! Since launching in January 2011, our mission at Punchfork has been to help home cooks discover new, high quality recipes and share them with family and friends. It is a mission driven by a belief in the ability of web and mobile platforms to inspire our lives offline--at home, in our communities, and for Punchfork, wherever meals are shared. To cooking aficionados, Pinterest...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
2.9M
2.9M
Apr 11, 2016
04/16
by
Archive Team
2M
2.0M
Nov 11, 2018
11/18
by
Archive Team
Thanks to Hiroi for leading this mirroring project. Prior to the takeover by Yahoo! , GeoCities had a Japanese subsidiary, GeoCities Japan. GeoCities Japan was headquartered in the Nihonbashi Hakozaki Building in the Nihonbashi area of Chūō, Tokyo . As of February 10, 2016, GeoCities Japan was still online, with no signs of upcoming closure. Its member sites are still accessible, and it is still accepting new account registrations, but now all services...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
A collection of news articles grabbed from a wide variety of sources around the world automatically by Archive Team scripts.
2.1M
2.1M
Aug 8, 2016
08/16
by
Archive Team
Archive Team: The Orkut Cut-up
518,452
518K
Aug 11, 2014
08/14
by
Archive Team
witch (also and formerly known as Twitch.tv) is a live streaming video platform; introduced in June 2011 as a spin-off of fellow streaming platform Justin.tv, the site primarily focuses on video gaming, including playthroughs of video games by users, along with broadcasts of e-sports competitions. Content on the site can either be viewed live, or viewed on an on-demand basis. The popularity of Twitch would eclipse that of its general-interest counterpart; by mid-2013, the website had amassed an...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
400,012
400K
Sep 27, 2020
09/20
by
Archive Team
A love story doomed in time - user creation by Naver Matome, and the eternal, grinding loss of the Web.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
324,087
324K
Sep 28, 2020
09/20
by
Archive Team
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
1.5M
1.5M
Jun 4, 2018
06/18
by
Archive Team
ZetaBoards (formally InvisionFree) is a forum host that offers both paid and free-with-ads forums to anyone. It claims to have been "used by millions of people looking for a place to gather, discuss and share.". Initial checks show that over 75,000 boards are listed on their "Featured Board Index" page on their site, showing the scale of this project. ZetaBoards is owned by Zathyus Networks, Inc., which also owns zIFBoards. Aside from a few hiccups, there was no huge...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
704,013
704K
Aug 1, 2019
08/19
by
Archive Team
894,785
895K
Mar 8, 2019
03/19
by
Archive Team
An archive team collection of Google Plus postings and communities, gathered in the 2019 bonfire of the ill-fated social network's repurposing.
1.3M
1.3M
Feb 16, 2017
02/17
by
Archive Team
The Internet Movie Database (IMDb) is shutting down its message boards, the organisation has announced. In a statement on its website, the IMDb said it had “concluded that IMDb’s message boards are no longer providing a positive, useful experience for the vast majority of our more than 250 million monthly users worldwide”, and that the decision was “based on data and traffic”. More specifically, the company – which was set up in 1990 by Bristol-based IT worker Col Needham and later...
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
1.3M
1.3M
Sep 16, 2017
09/17
by
Archive Team
This is a grab of fanfiction.net, a prominent host for fiction set in existing fictional universes. The grab is from 2012. Most of the items are WARCs, suitable for use in the Wayback Machine -- but Fanfiction.net Safety Download is a single 2 GB tar file containing epub files, which may be easier to extract. See http://archiveteam.org/index.php?title=FanFiction.Net for other grabs of the site.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.